Rank in Wordlist | Frequency | Word |
---|---|---|
10712 | 17 | COVID-19 |
11257 | 17 | 코로나-19 |
12087 | 15 | Wi-Fi |
15943 | 12 | 코로나바이러스감염증-19 |
24336 | 7 | X-ray |
27655 | 6 | 2020-2021 |
27872 | 6 | V-리그 |
31959 | 6 | 한-미 |
31960 | 6 | 한-중 |
32261 | 5 | 1-0으로 |
32358 | 5 | 2-1로 |
32599 | 5 | COVID-19의 |
32625 | 5 | F7U050-패드 |
32658 | 5 | K-IFRS |
32659 | 5 | K-POP |
32669 | 5 | LPH-6112 |
32689 | 5 | One-Stop |
34638 | 5 | 미-중 |
38216 | 5 | 코로나-19로 |
39523 | 4 | Covid-19 |
Rank in Wordlist | Frequency | Word |
---|---|---|
68121 | 2 | 080-3000-5000 |
88322 | 2 | 북-중-러의 |
114355 | 1 | 010-3308-5995 |
114356 | 1 | 010-3502-5870 |
114357 | 1 | 010-6832-2164로 |
114358 | 1 | 010-9636-7428 |
114359 | 1 | 010-9일3일-0이5구 |
114370 | 1 | 02-1599-1389 |
114371 | 1 | 02-2113-8515로 |
114372 | 1 | 02-2220-0875 |
Rank in Wordlist | Frequency | Word |
---|---|---|
114405 | 1 | 031-299-0511-5 |
123558 | 1 | 3-3-0-0 |
125768 | 1 | 4-2-3-1로 |
133846 | 1 | C-1‧C-2‧C-3블록에 |
137761 | 1 | Jobs-To-Be-Done |
145975 | 1 | staking-as-a-service |
152479 | 1 | ‘창업-성장-회수-재도전’에 |
159369 | 1 | 가디언그랄---울트라레어 |
191377 | 1 | 기사----- |
193004 | 1 | 기획-기술개발-실증-양산까지 |
Rank in Wordlist | Frequency | Word |
---|---|---|
191377 | 1 | 기사----- |
237244 | 1 | 박근혜-이재용-김재열-최순실-장시호 |
252475 | 1 | 분석-검역-감시-대응-진단-위기 |
275397 | 1 | 손학규-정동영-이해찬-유시민-한명숙 |
324662 | 1 | 인-권-센-터-space-규-정-이 |
328938 | 1 | 임영웅-영탁-이찬원-정동원-장민호-김희재가 |
347312 | 1 | 제출-주민투표-투표발의-투표-특별법통과-출범 |
394719 | 1 | 한국전기연구원-재료연구소-한국세라믹연구원-국방기술품질원-한국생산기술연구 |
Some languages allow the formation of longer word by composition using hyphens. Moreover, proper names may contain hyphens. Therefore we look for the most frequent words containing 1, 2, 3 or 4 hyphens.
Usually we find interesting words. But in the case of poor preprocessing there may be unexpected strings resulting from hyphenation etc. Words ending with an hyphen are usually not welcome, too.
For three hyphens:
select w_id-100,freq, word from words where word like "%-%-%-%" limit 10;
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots
3.12.4 Words containing special characters